Using LogitBoost classifier to predict protein structural classes.
نویسندگان
چکیده
Prediction of protein classification is an important topic in molecular biology. This is because it is able to not only provide useful information from the viewpoint of structure itself, but also greatly stimulate the characterization of many other features of proteins that may be closely correlated with their biological functions. In this paper, the LogitBoost, one of the boosting algorithms developed recently, is introduced for predicting protein structural classes. It performs classification using a regression scheme as the base learner, which can handle multi-class problems and is particularly superior in coping with noisy data. It was demonstrated that the LogitBoost outperformed the support vector machines in predicting the structural classes for a given dataset, indicating that the new classifier is very promising. It is anticipated that the power in predicting protein structural classes as well as many other bio-macromolecular attributes will be further strengthened if the LogitBoost and some other existing algorithms can be effectively complemented with each other.
منابع مشابه
Boosting classifier for predicting protein domain structural class.
A novel classifier, the so-called "LogitBoost" classifier, was introduced to predict the structural class of a protein domain according to its amino acid sequence. LogitBoost is featured by introducing a log-likelihood loss function to reduce the sensitivity to noise and outliers, as well as by performing classification via combining many weak classifiers together to build up a very strong and ...
متن کاملUsing Bagging classifier to predict protein domain structural class.
Classification and prediction of protein domain structural class is one of the important topics in the molecular biology. We introduce the Bagging (Bootstrap aggregating), one of the bootstrap methods, for classifying and predicting protein structural classes. By a bootstrap aggregating procedure, the Bagging can improve a weak classifier, for instance the random tree method, to a significant s...
متن کاملS Tudents ’ P Erformance P Rediction S Ystem Using M Ulti a Gent Data M Ining T Echnique
A high prediction accuracy of the students’ performance is more helpful to identify the low performance students at the beginning of the learning process. Data mining is used to attain this objective. Data mining techniques are used to discover models or patterns of data, and it is much helpful in the decision-making. Boosting technique is the most popular techniques for constructing ensembles ...
متن کاملAn Artificial Neural Network Classifier for the Prediction of Protein Structural Classes
As there are quite a few difficulties for us to predict a protein structural class directly from its primary sequence, the protein structural prediction based on the predicted secondary structure will undoubtedly be the first choice we would like to take. Protein structural classes are generally defined as four classes: α, β, α/β, α +β. The protein secondary structure describes the local struct...
متن کاملStruct-NB: predicting protein-RNA binding sites using structural features
We analyse sequence and structural features of protein-RNA interfaces using RB-147, a non-redundant dataset of protein-RNA complexes extracted from the PDB. We train classifiers using machine learning algorithms to predict protein-RNA interfaces from sequence and structure-derived features of proteins. Our experiments show that Struct-NB, a Naive Bayes classifier that exploits structural featur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of theoretical biology
دوره 238 1 شماره
صفحات -
تاریخ انتشار 2006